Support of strides in the convolutional layers #239

jvdp1 · 2025-10-25T18:52:14Z

Proposal to support strides in the convolutional layers

@Riccardo231 @milancurcic does this approach make sense? If yes, I will continue its implementation.

milancurcic · 2025-10-25T19:17:42Z

Yes, it's exactly how I would do it, thanks Jeremie.

Riccardo231 · 2025-10-25T19:30:57Z

I think it is a good implementation. Thanks for your job

Riccardo231 · 2025-10-28T14:41:27Z

I'd like to offer my support whenever it is needed, feel free to contact me, right now I'm busy to develop improvements on my own but I can cooperate with someone else
@milancurcic @jvdp1

jvdp1 · 2025-10-31T16:50:00Z

src/nf/nf_conv2d_layer_submodule.f90

+      self % gradient(k,iws:iwe,jws:jwe) = self % gradient(k,iws:iwe,jws:jwe) &
+        + gdz(n,iws:iwe,jws:jwe) * self % kernel(n,k,1:iwe-iws+1,1:jwe-jws+1)


@Riccardo231 Could you check this, please? I think it has a different behaviour now. However, I am not sure what was the goal before the change, because all entries of self % gradient were not updated (that is only the entries between istart:iend and jstart:jend were updated).

I think I implemented the conv2d variant.. I'll look at this carefully over the weekend. It's possible that the original code was bad.

I'll have a look tomorrow.

Sorry, just to inform you these days I'm really busy with school. Can probably have a look after the 5th. Sorry for the delay

Sorry, just to inform you these days I'm really busy with school. Can probably have a look after the 5th. Sorry for the delay

No worries. Whenever you have time. Thank you.

Hello, sadly I have been missing some days more than expected. However, now I am ready to commit to the project. Do I still have to take a look or was it already done? Thank you and sorry for the delay

@Riccardo231, no problems. If you have time, it would be nice if you would take a look at this PR.

Hello, just had a look, ran a test with cnn_mnist.f90, didn't modify any parameter, old and new version both converge to 80% accuracy within a 10 epoch range. Your changes look good to me but I didn't take a deep dive, however feel free to tell me whenever it's needed.

Hi all, I appreciate your patience with this.

Based on my comment ! dL/dx = dL/dy * sigma'(z) .inner. w, assuming it is correct based on my understanding of how backward pass of convolutional layers works, this should be an inner product (sum of element-wise products), rather than simply element-wise products added element-wise to the gradient.

It is true that the original implementation would make the edges of the gradient not updated, but I think this is merely a consequence of summing the kernel width to a scalar result.

So, I think the previous code was correct, meaning, we need the full inner product (including the sum to get the scalar result), not just element-wise product and assign.

If I'm correct, it means that Riccardo's conv1d backward pass should be updated to reflect this.

Let me know what you think.

I think you are right. So, should it be something like:

Suggested change

self % gradient(k,iws:iwe,jws:jwe) = self % gradient(k,iws:iwe,jws:jwe) &

+ gdz(n,iws:iwe,jws:jwe) * self % kernel(n,k,1:iwe-iws+1,1:jwe-jws+1)

self % gradient(k,i,j) = self % gradient(k,i,j) &

+ sumgdz(n,iws:iwe,jws:jwe) * self % kernel(n,k,1:iwe-iws+1,1:jwe-jws+1))

If correct, conv1d must be revised too, as well as locally_connected.

test/test_conv1d_layer.f90

jvdp1 · 2025-10-31T16:52:15Z

@milancurcic @Riccardo231 Pending a comment/question, this PR is ready for review and/or to be merged.

jvdp1 added 3 commits October 25, 2025 20:29

Addition of stride in API of conv

1277358

implementation of stride in conv1d

e73af4a

start implementation of stride in conv2d

eb7e112

jvdp1 requested review from Riccardo231 and milancurcic and removed request for milancurcic October 25, 2025 18:52

Riccardo231 approved these changes Oct 25, 2025

View reviewed changes

jvdp1 added 2 commits October 31, 2025 15:39

Fix conv1d with stride

824bb13

Implementation of stride

553a55e

jvdp1 commented Oct 31, 2025

View reviewed changes

test/test_conv1d_layer.f90 Outdated Show resolved Hide resolved

Apply suggestions from code review

f500d49

Merge remote-tracking branch 'upstream/main' into stride_conv

f25f2c1

jvdp1 mentioned this pull request Nov 9, 2025

Support of the argument stride for locally_connected2d_layer #240

Draft

Resolve conflicts with main

c89c36f

		self % gradient(k,iws:iwe,jws:jwe) = self % gradient(k,iws:iwe,jws:jwe) &
		+ gdz(n,iws:iwe,jws:jwe) * self % kernel(n,k,1:iwe-iws+1,1:jwe-jws+1)

Support of strides in the convolutional layers #239

Are you sure you want to change the base?

Support of strides in the convolutional layers #239

Uh oh!

Conversation

jvdp1 commented Oct 25, 2025

Uh oh!

milancurcic commented Oct 25, 2025

Uh oh!

Riccardo231 commented Oct 25, 2025

Uh oh!

Riccardo231 commented Oct 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jvdp1 commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants